Website SEO Audit Crawler — On-Page Analyzer
Pricing
from $5.00 / 1,000 results
Website SEO Audit Crawler — On-Page Analyzer
Crawl a whole website and export an on-page SEO audit to CSV/JSON: meta tag checker, canonical & noindex checker, schema.org type extractor. No browser.
Pricing
from $5.00 / 1,000 results
Rating
0.0
(0)
Developer
Logiover
Maintained by CommunityActor stats
0
Bookmarked
34
Total users
21
Monthly active users
2 days ago
Last modified
Categories
Share
Website SEO Audit Crawler — On-Page SEO Analyzer & Spider 🔍
Audit the on-page SEO of an entire website in one run. This SEO crawler / SEO spider crawls a whole site and runs a full on-page SEO audit on every page — titles, meta descriptions, headings, canonical tags, robots/indexability, Open Graph, schema.org types, word count, image alt coverage and internal/external link counts — plus a ready-made list of detected SEO issues.
Point it at one URL and it audits thousands of pages automatically. No login, no headless browser, one row per page.
Looking for a technical SEO audit tool, on-page SEO checker, meta tag checker, title tag checker or a bulk website SEO analyzer? This actor does the full crawl-and-audit at scale.
✨ Key features
- 🕷️ Full-site crawl — start from one URL and follow internal links across the whole domain.
- 🏷️ Complete on-page data — title, meta description, H1/H2, canonical, robots, Open Graph, Twitter cards, hreflang, charset, viewport.
- 🧱 Structured data detection — extracts schema.org
@types from JSON-LD. - 🚩 Automatic issue flags — missing titles, duplicate/multiple H1, thin content, missing meta, missing canonical, noindex, images without alt, and more.
- 🔢 Quality metrics — honest word count (boilerplate removed), image alt coverage, internal/external link counts.
- ⚡ Fast & cheap — pure HTTP, no browser, high concurrency.
💡 Use cases
- Technical SEO audits — surface on-page issues across a whole site in a single run.
- Content audits — spot thin content, missing meta descriptions and duplicate titles.
- Pre-launch QA — verify titles, canonicals and indexability before going live.
- SEO monitoring — schedule the crawl and track on-page SEO health over time.
- Migration checks — confirm meta, canonicals and indexability survived a redesign.
📦 What you get
One row per page, including:
| Field | Description |
|---|---|
url, statusCode | Page URL and HTTP status |
title, titleLength | Title tag and its length |
metaDescription, metaDescriptionLength | Meta description and length |
h1, h1Count, h2Count | Headings |
canonical, metaRobots, isIndexable | Indexability signals |
lang, charset, hasViewport, hreflangCount | Technical tags |
wordCount | Body word count (boilerplate removed) |
imagesCount, imagesMissingAlt | Image alt coverage |
internalLinks, externalLinks | Link counts |
ogTitle, ogDescription, ogImage, twitterCard | Social tags |
schemaTypes | Detected schema.org @types |
issues | Detected on-page SEO problems |
Detected issues
missing-title, title-too-long, title-too-short, missing-meta-description, meta-description-too-long, meta-description-too-short, missing-h1, multiple-h1, missing-canonical, noindex, images-missing-alt, missing-viewport, thin-content.
Example output
{"url": "https://example.com/blog/post","statusCode": 200,"title": "How to Do On-Page SEO","titleLength": 22,"metaDescriptionLength": 0,"h1": ["How to Do On-Page SEO"],"h1Count": 1,"wordCount": 1240,"imagesMissingAlt": 3,"internalLinks": 41,"externalLinks": 6,"schemaTypes": ["Article", "BreadcrumbList"],"issues": ["missing-meta-description", "images-missing-alt"]}
🚀 How to use it
- Click Try for free / Start.
- Paste one or more website URLs into Start URLs.
- (Optional) Set Max pages to crawl —
0to audit the whole site. - Click Save & Start.
- Export the audit as JSON, CSV, Excel or via API, and sort/filter by the
issuescolumn.
⚙️ Input
| Option | Description | Default |
|---|---|---|
startUrls | Websites to crawl | – (required) |
maxPagesToCrawl | Max pages per run (0 = whole site) | 1000 |
maxConcurrency | Parallel requests | 10 |
Example input
{"startUrls": [{ "url": "https://example.com" }],"maxPagesToCrawl": 5000}
🔍 How it works
The crawler follows internal links within the same domain as your Start URLs and analyzes the server-rendered HTML of each page: it reads all on-page SEO tags, counts headings/links/images, computes a boilerplate-free word count, detects schema.org types, and flags common SEO issues. Pure HTTP — fast and cheap, no headless browser.
🧰 Tips & best practices
- Sort or filter by the
issuesfield to build a prioritized SEO fix-list. - Use
titleLength/metaDescriptionLengthto catch tags that will be truncated in search results. - Filter
isIndexable: falseto find pages accidentally set tonoindex. - Set
maxPagesToCrawlto0for a complete site audit; schedule it to monitor SEO health.
❓ FAQ
Does it render JavaScript? No — it audits server-rendered HTML, which is what search engines index first, and keeps runs fast and cheap.
Does it detect structured data? Yes — it extracts schema.org @types from JSON-LD on each page.
How is wordCount calculated? From the body text after removing scripts, styles, nav, header, footer and sidebars — an honest content word count.
How do I audit the whole site? Set maxPagesToCrawl to 0.
What formats can I export? JSON, CSV, Excel, HTML and a full REST API.
How do I export an on-page SEO audit to CSV or JSON?
Run the crawl, then download the dataset as CSV, JSON or Excel (or pull it via the REST API). Every audited page is one row, so it drops straight into a spreadsheet.
Can I crawl a website for SEO without a browser?
Yes — this is a pure-HTTP SEO crawler with no headless browser, so it audits server-rendered HTML fast and cheap across thousands of pages.
Is there a site-wide meta tag and canonical checker?
Yes — it works as a bulk meta tag checker and a noindex/canonical checker across the whole site, flagging missing titles, missing meta descriptions, multiple H1s and noindex pages.
Does it extract schema.org structured-data types?
Yes — it acts as a schema.org type extractor, pulling every JSON-LD @type it finds on each crawled page.
🔗 Related actors by the same author
- Broken Link Checker — crawl a site and find dead links with HTTP status codes.
- Website Link Graph & Outbound Links Crawler — map internal & outbound links.
- JSON-LD Schema & Meta Tag Extractor — deep structured-data & meta extraction per URL.
- Sitemap to URL Crawler — extract all URLs from any sitemap.xml.
📝 Changelog
2026-06-15
- Reliability pass: re-verified end-to-end on live data with real-world inputs. Routine maintenance build.
2026-06-07
- Docs: added coverage for exporting an on-page SEO audit to CSV/JSON, crawling a site without a browser, and site-wide meta tag / canonical / schema.org type checking.
2026-06-05
- 🛡️ Reliability fix: results are no longer dropped by strict output validation — runs now complete cleanly even at high volume (thousands of results).
- ⚡ Stability & performance hardening; fresh rebuild.
2026-06-04
- Verified live & refreshed build — reliability/maintenance pass.